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Abstract. We follow [18|25| and give a series of improved results for 
the Constrained Fault-Tolerant Resource Allocation (FTRA) problem. 
In FTRA, we are given a set of sites containing facilities as resources, 
and a set of clients consuming these resources. Specifically, each site i is 
allowed to open at most Ri facilities with cost /; for each opened facility. 
Each client j requires an allocation of rj facilities and connecting j to 
any facility at site i incurs a connection cost dj . The goal is to minimize 
the total cost of this resource allocation scenario. 

FTRA generalizes the Unconstrained Fault-Tolerant Resource Allo- 
cation (FTRAoo) |18| and the classical Fault-Tolerant Facility Location 
(FTFL) [13] problems: for every site i, FTRAoo does not have the con- 
straint Ri, whereas FTFL sets R4 = 1. These problems are said to be 
uniform if all r/s are the same, and general otherwise. 

In this paper, we strive to close the gap between FTRA and FTFL 
from the perspective of approximation. For the general metric FTRA, we 
first give an LP-rounding algorithm achieving the approximation ratio 
of 4. Then we show its reduction to FTFL, implying the ratio of 1.7245. 
For the uniform FTRA, we provide a 1.52-approximation primal-dual 
algorithm in O (n 4 ) , where n is the total number of sites and clients. 
For completeness in the line of |25| for FTFL, we also consider the 
Constrained Fault- Tolerant fc-Resource Allocation (KFTRA) problem 
where additionally the total number of facilities can be opened across the 
sites is at most k. For the uniform KFTRA, we give the first constant- 
factor approximation algorithm with a factor of 4. Note that the above 
results carry over to FTRAoo and K FTRAoo ■ 



1 Introduction 

In the Constrained Fault- Tolerant Resource Allocation (FTRA) problem intro- 
duced in 1 18], we are given a set T of sites and a set C of clients, where = nj, 

* A small part of the Section 5 of this paper has appeared in |18j . 



2 Kewen Liao, Hong Shen and Longkun Guo 



\C\ = n c and n — nt + n c . Each site i G J contains at most Ri (Ri > 1) facilities 
to open as resources and each client j 6 C is required to be allocated rj (rj > 1) 
different facilities. Note that in FTRA, facilities at the same site are different 
and maxjgc r j < 12iej^^i- Moreover, opening a facility at site i incurs a cost 
fi and connecting j to any facility at i costs Cij . The objective of the problem 
is to minimize the sum of facility opening and client connection costs under 
the resource constraint Ri. This problem is closely related to the Unconstrained 
Fault- Tolerant Resource Allocation (FTRA^, |18| , the classical Fault- Tolerant 
Facility Location (FTFL) [13 and Uncapacitated Facility Location (UFL) [23 
problems. Both FTRA^ and FTFL are special cases of FTRA. In FTRA^, 
Ri is not specified, whereas Vi e T : Ri = 1 for FTFL. These problems are 
uniform if all r^'s are same, otherwise they are general. If Vj £ C : rj = 1, 
they all reduce to UFL. Fig. 1 displays an FTRA instance with a feasible so- 
lution. We notice that both FTRA and FTRAoo have potential applications in 
today's distributed and high performance computing (DHPC) systems such as 
the cloud computing. The fault-tolerance attribute can be also viewed as the 
parallel processing capability of these systems. Unless elsewhere specified, we 
consider the problems in metric space, that is, the connection costs Cy's satisfy 
the metric properties like the triangle inequality and etc. Note that even the 
simplest non-metric UFL is hard to approximate better than O (log n) unless 
NP C DTI ME [ n O(iogio g n)j j^g 



| open facility □ closed facility O clients 

Bi = 3 R 2 = 2 i? 3 = 4 




ri =2 r 2 = 3 1-3 = 3 n = 4 



Fig. 1. An FTRA instance with a feasible solution 



Related Work. Primal-dual and LP-rounding are two promising approaches 
in designing approximation algorithms for the facility location problems. Starting 
from the most basic and extensively studied UFL problem, there are JV |14j . 
MMS [21] and JMS p2j primal-dual algorithms obtaining approximation ratios 
of 3, 1.861 and 1.61 respectively. In addition, Charikar and Guha [4 improved 
the result of the JV algorithm to 1.853 and Mahdian et al. improved that 
of the JMS algorithm to 1.52, both using the standard cost scaling and greedy 
augmentation techniques. Shmoys et al. [23 first gave a filtering based LP- 

1 The problem is also called Fault- Tolerant Facility Allocation (FT FA) in |26| and 
Fault- Tolerant Facility Placement (FTFP) in [27]. Nevertheless, we reserve our 
name for identifying the different application-oriented resource allocation scenario. 
Our naming convention also follows from [6110115] for the set cover problems. 
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rounding algorithm achieving the constant ratio of 3.16. Following this, Guha and 
Khuller |7J improved the factor to 2.41 with greedy augmentation. Later, Chudak 
and Shmoys [5] came up with the clustered randomized rounding algorithm 
which further reduces the ratio to 1.736. Based on their algorithm, Sviridenko 
|24| applied pipage rounding to obtain 1.582-approximation. Byrka [1 achieved 
the ratio of 1.5 using a bi-factor result of the JMS algorithm. Recently, Li's more 
refined analysis in [17] obtained the current best ratio of 1.488, which is close to 
the 1.463 lower bound established by Guha and Khuller [7J. 

Comparing to UFL, FTFL seems more difficult to approximate. For the 
general FTFL, the primal-dual algorithm in [13] yields a non-constant factor 
O (logn). Constant results exist only for the uniform case. In particular, Jain et 
al. |11I20| showed their MMS and JMS algorithms for UFL can be adapted to 
the uniform FTFL while preserving the ratios of 1.861 and 1.61 respectively. 
Swamy and Shmoys [IF] improved the result to 1.52. On the other hand, LP- 
rounding approaches are more successful for the general FTFL. Guha et al. [8] 
obtained the first constant factor algorithm with the ratio of 2.408. Later, this 
was improved to 2.076 by Swamy and Shmoys [25] with several rounding tech- 
niques. Recently, Byrka et al. |3J used dependent rounding and laminar clustering 
techniques to get the current best ratio of 1.7245. 

FTRAoo was first introduced by Xu and Shen [25] and they claimed a 1.861 
approximation algorithm which runs in pseudo-polynomial time for the general 
case. Liao and Shen [IB] studied the uniform case of the problem and presented 
a 1.52 approximation algorithm using a star-greedy approach. The general case 
of the problem was also studied by Yan and Chrobak [27 who gave a 3.16- 
approximation LP-rounding algorithm based on [23 5 J , and recently claimed the 
ratio of 1.575 [29 built on the work of |5I1I2I8| . They aim to close the approxima- 
tion gap between FTRAoo and UFL. On the other hand, due to the difficulties 
inherited from FTFL and FTRAoo , it is still unknown what the approximation 
gap between FTRA and FTFL is. 

In this paper, we strive to close this gap. However, there are several difficul- 
ties. First, despite the similar combinatorial structures of FTRAoo and FTRA, 
the existing LP-rounding algorithms [27 29_ for FTRAoo can not be adapted 
for FTRA. The main reason is that the constraint Ri in FTRA makes these 
algorithms produce infeasible solutions. In particular, the most recent work of 
[29 requires liberally splitting facilities and randomly opening them. This can 
not be done for both FTRA and FTFL as the splitting may cause more than 
Ri facilities to open, which is not a problem for FTRAoo- Second, in FTFL, 
maxj e c r j < n fi while rj can be much larger than n$ in both FTRAoo and 
FTRA. Therefore, the naive reduction idea of splitting the sites of an FTRA 
instance and then restrict each site to have at most one facility will create an 
equivalent FTFL instance with a possibly exponential size. Third, significantly 
more insights and heuristics are needed in addition to the previous work for 
solving FTRA (both the general and the uniform cases) in polynomial time. 

Our Contribution. For the general FTRA, we develop a unified LP-rounding 
algorithm through modifying the 4-approximation LP-rounding algorithm [25 
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for FTFL. The unified algorithm can directly solve FTRA, FTRA^ and FTFL 
with the same approximation ratio of 4. This is achieved by: 1) observing and 
proving some structured properties of the algorithm for directly rounding the 
optimal fractional solutions with values that might exceed one while ensuring 
the feasibility of the rounded solutions and the correctness of the algorithm; 2) 
exploiting the primal and dual complementary slackness conditions of the FTRA 
problem's LP formulation for achieving the ratio. Then we show FTRA reduces 
to FTFL using an instance shrinking technique inspired from the demand re- 
duction trick of [55] for FTRA^. It implies that these two problems share the 
same approximability in weakly polynomial time. Hence, from the FTFL result 
of [3], we obtain the ratio of 1.7245. For the non-metric FTRA, we get the first 
approximation factor of O (log 2 n) deduced from the results of |13|9j . 

For the uniform FTRA, we first present a naive primal-dual algorithm which 
runs in pseudo-polynomial time. In order to analyze it, we subsume dual fitting 
fTT] and inverse dual fitting |26I19| analysis approaches into a simple and sys- 
tematic constraint-based analysis to derive the ratio of 1.61. Later, with a speed- 
up heuristic applied to the primal-dual algorithm, we obtain the first strongly 
polynomial time algorithm for FTRA having the same ratio of 1.61 but with 
runtime O (n 4 ) . Moreover, with another similar heuristic applied to the greedy 
augmentation technique jS], the 3.16-approximation rounding result of [27] for 
the general FTRAoo is improved to 2.408, and the previous 1.61 ratio for the 
uniform FTRA reduces to 1.52. 

Lastly, we consider the Constrained Fault-Tolerant fc-Resource Allocation 
{KFTRA) problem by adding an extra constraint that at most k facilities 
across the sites can be opened as resources. For the uniform KFTRA, we give 
a 4-approximation algorithm based on the work of [14125] . In particular, the 
algorithm relies on a polynomial time greedy pairing procedure we develop for 
efficiently splitting sites into paired and unpaired facilities. 

Note that the results shown in the paper directly hold for FTRAqo and 
KFTRAoo, and the techniques developed will be useful for other variants of 
the resource allocation problems. For ease of analysis and implementation, the 
algorithms presented mostly follow the pseudocode style. Furthermore, we dis- 
tinguish among pseudo-, weakly and strongly polynomial time algorithms w.r.t. 
the problem size n. 

2 LP Basics and Properties 

The FTRA problem has the following ILP formulation [18_, in which solution 
variable yi denotes the number of facilities to open at site i, and Xij the number 
of connections between client j and site i. 

minimize £\ £jr f m + ]T\ g ^ £ j6 c c l3 x l3 
subject to Vj G C : J^igjF iy > Tj 

Vi€T,j G C : yi-Xij > (1) 

Vi G T : yi < Ri 

Vl G T,j G C : Xij, yi G Z + 
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The problem's LP-relaxation (primal LP) and dual LP are the following: 

minimize £\ £jr f*V* + E i6 ^ E j6 c C 'J X ^J 
subject to V j G C : %ij > J*j 

Vi € 7,j eC = Vi -Xij > (2) 

Vi G -F : yi < Ri 

VieJj'eC: yi > 

maximize J] j6C r^ay - RiZi 

subject to Vi G 7 : J2 je c ^ /« + 2i (^i 
Vi G J", j G C : dj - fiij < Cij 
Vi G 7,j G C : a j: fa, Zi > 

From the formulation, it is easy to see that the problem becomes FTFL if 
Vi € J 7 : iZf = 1, and FTRAqo if the third resource constraint of the ILP is 
removed. Now if we let (a;*, y*) and (a*, /3*, z*) be the optimal fractional pri- 
mal and dual solutions, and cost (x, y) and cost (a, /3, 2) be the cost functions 
(objective value functions) of any primal and dual solutions respectively. By the 
strong duality theorem, cost (x* , y*) = cost (a*, (3*, z*). Moreover, the primal 
complementary slackness conditions (CSCs) are: 

(CI) If x* 3 > then a* = /3£ + c„ . 

(C2) If y* >0 then £ ieC ft. = ft + 

Dual CSCs are: 

(C3) If a* > then £ ie ^« = r,-. 
(C4) If ^ > then x* tJ = y* . 
(C5) If z* > then y* = 

W.l.o.g., (sc*, y*) and (a*, /3*, z*) have the following properties: 
(PI) VjeC; a* > and £^ x*. = r,, 

(P2) (a;*, y*) is 'almost' complete, i.e. Vj G C : if i*- > then a;^- = y* (the 
complete condition) or there is at most one i s.t. < a;*,- < y* where i is the 
farthest site connecting j. (cf. [5 25 29_ for more details) 



3 A Unified LP-Rounding Algorithm 

The algorithm ULPR (Algorithm 1) starts by solving the primal and dual LPs 
to get the optimal solutions (x*, y*) and (a*, /3*, z*) to work with. In order to 
utilize the dual LP (LP |3|) for analyzing the approximation ratio of the output 
solution (x, y), first of all, we encounter the difficulty (cf. LemmaJIJ of bounding 
the — ^2 i€ jr RiZi term in the dual objective function. To resolve this, we exploit 
the dual CSC (C5). The condition guides us to modify the 4-approximation 
algorithm of [25;, and come up with Stage 1 of the algorithm ULPR which fully 
opens all (facilities of) sites with y* — R t and put these sites into the set V 
for pruning in future. Moreover, for successfully deriving the bound stated in 
Lemma [TJ in the algorithm the client connections x*j with the opened sites in 
V are rounded up to \x*A ; in the analysis the other primal and dual CSCs are 
also exploited. At the end of Stage 1, for each j, we calculate its established 
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connection fj, residual connection requirement fj and record its connected sites 
not in V as set Tj for the use of next stage. 



Algorithm 1 ULPR: Unified LP-Rounding Algorithm 
Input: F, C, f, c, r, R. Output: (x, y) 

Initialization: Solve LPs |2]| and |3| to obtain the optimal fractional solutions 
(as*, y*) and (a*, f3* , z*). x ^ 0, y ^ 0, V ^ 0) 
Stage 1: Pruning and Rounding 
for 2 G F 

if y* = Ri do 
y% Ri 

for j G C 



set Vj G C : fj J2 ie v X H > r i *~ r i ~ r 'i > ?i *~ { l e -^V I **j > °} 

Stage 2: Clustered Rounding 
set CV{jeC r, > 1} 
while C # 

/*2.1: Construct a cluster S centered at j : directly use the SI and S2 steps 
for FTFL (cf. page 6 of [53]) and tailor them for FTRA*/ 
jo "s— argminj {a*j : j G C}, order .7-"^ by non-decreasing site facility costs 
choose S C jr, o starting from the cheapest site in Tj a s.t. just E;es 2/< — r L 
if E ieS 2/i > r 7o do 

split the last most expensive site ii £ S into i\ and «2: = r] — Eigs\i, 2/? j 

U%2 Vi\ Vit' f*-*rall j. Set X^j, %i 2 j S.t. 3?^* ~t~ %i 2 j X iiji X i\j — Vi\ 

x* 2 j < yt 2 and update Fj\ S <- S\ {it} U {ij (now J2 ieS Vi = r Jo) 

//2.2: Rounding around j D and 5 

//2.2.1: Finish rounding y 

for i G 5 / / from the cheapest site 

S/i <- \yX\ 

5<-SU {i} / /maintain a set of already rounded sites 

if Ei/ e sl/i'_> r L 

2/i <- r Jo - Ei'esViJ/i' (resetting y t to make Ei'eS^ = r jo) 
break 

/ / '2.2.2: Finish rounding cc 
for j 6 C / /including j 

if Jj-ns/ 0_ 

for i G 5 //order does not matter, could connect to the closest 
Xij <- min (fj, yi) 



The second major difficulty is that many LP-rounding algorithms round op- 
timal solutions with values in [0, 1], whereas for FTRA, our approach is unified 



if x*i > do 




=jFi\S 
update C 
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and general which directly rounds the solutions which might exceed 1 and later 
we shall analyze its correctness via establishing some useful properties. Stage 2 
is inherited from the classical iterative clustering idea [5 for UFL. Similarly, our 
clustering and rounding terminate when all fj's are satisfied, i.e. the set of not- 
fully-connected clients C = in the algorithm. Stage 2 consists of two substages 
2.1 and 2.2, dealing with cluster construction and cluster guided rounding re- 
spectively. As pointed out in the algorithm, Stage 2.1 adopts the splitting idea of 
[25 for FTFL and tailors it for FTRA. In each iteration, it first picks the cluster 
center j a with the smallest optimal dual value, and then builds a cluster S around 
it which contains a subset of ordered sites in Tj a , starting from the cheapest site 
until X^ies Vi — r i°- or der to maintain the invariant Mj G C : ^2 i€ jr. y* > fj 
in every iteration, the stage then splits the last site ii € S into i\ and «2, updates 
the client connections w.r.t. i\ and i 2 , and in 5 includes i\ while excluding i\ to 
keep J^ies yl = r lo ■ Stage 2.2 does the final rounding steps around S in addition 
to Stage 1 to produce a feasible integral solution (as, y). This stage modifies and 
generalizes the rounding steps for FTFL. Its substage 2.2.1 rounds up the sites 
(y* — > \y*~\) from the cheapest site in S until S (the set of sites rounded so far) 
just satisfies Xj'eS f*' — r ~io ( now these y^'s are already integral). To make sure 
Si'gs^* = T ~3o f° r bounding the site facility opening cost (cf. Lemma |2j), the 
integral facility opening t/j of the last site i in S is reset to fj o — X^'ESU^i'' 
which is also integral. After the facilities at the sites in S are opened according 
to the yi's constructed in stage 2.2.1, stage 2.2.2 then connects every client j in 
C which has connections to the sites in S (according to the x*j 's) to min (fj , f] o ) 
of these open facilities. It does this by iterating through all sites in S, setting 
Xjj's and updating fj's as described in the algorithm. At the end, for the run 
of next iteration, the sites in the cluster S are excluded from Fj. implying all 
clusters chosen in the iterations are disjoint; and C is updated (at least j is 
removed from the set). 

In the analysis, we first demonstrate the overall correctness of the algorithm 
ensured by the following properties. Note that some of the proofs in this section 
frequently refer to the content of Section 2. 

(P3) After Stage 1, Mi G V , j € C : x %j < R % and fj = Tj - fj > 0. 

Proof. The first part of the property is obvious since Xij = or Xij = \x*j] < 
Ty^l — R*' F° r the second part, MjEC: if all x*j are integers, we are done. Now 
we only need to consider fs fractional x*j with V . By the previous property 
(P2), there is at most one fractional x*j with V because all y*'s in V are integers. 
Therefore, in Stage 1, at most one fractional x*j is rounded up which will not 
make fj exceed r,-. 

(P4) Stage 2.2.1 rounds (the optimal fractional opening of the last site i\ 
in S which is included in Stage 2.1) to at most [y* J . 

Proof. If yl is integral, the property clearly holds. Otherwise if y* is fractional, 
Case 1): if J^i'^sVi' = r lo before resetting the last site i in Stage 2.2.1, this 
i definitely appears before i\ in S because otherwise J2i'es \Vi'~\ wm excee d 
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rj a , therefore i\ is left unrounded; Case 2): If J^i'esVi' > r 3o> tne l* 3 * site i is 
possibly i\ and if it is then S = S, and from the algorithm we have rounded 
Vii = r ~3o - T,i>eS\h \V*A after resetting. If y h > [y* h \ we get T,i>eS\h \v*A + 
y*J < r ~3o which is not Possible since £ ieS y| = r~ jo = Y,i'eS\h V*v + V*h = 
Ei'esVx Vv + k*J ^ £<'es\ii r»*'l + k*J (because y? is fractional). Hence, 
y* x is rounded to at most [y^ j . 

(P5) Vi e J : given < y* + y* 2 = y* < R u then we have |y*J + \y* 2 ] < 

\vt] < Ri- 

Proof. We first have [yfj < i£ and [y*J < y* 2 + 1, so [j/*J + fe] < l£ + 
y* 2 + 1 = y* + 1. Now if y* is integral, because [y*J + [y* 2 ] is also integral, 
M + \y{] < m ■ Otherwise if y* is fractional, [y*J + fe] < Lw? J +1 = fo? 1 ■ 
The property then follows from Vi € J 7 : [~y*] < i?^. 

In summary, property (P3) shows the correctness of Stage 1 before going into 
Stage 2. (P4) and (P5) together ensure the splitting in Stage 2.1 and the rounding 
in Stage 2.2.1 produce feasible yj's for FTRA. This is because for any split sites 
i\ and ii from i, (P4) guarantees at i\ at most [y* J facilities are open, and (P5) 
makes sure that even |~y* 2 ] facilities are opened at i 2 in the subsequent iterations 
of the algorithm, no more than Ri facilities in total actually get opened at i. 
Note that, (P5) also covers the situation that a site is repeatedly (recursively) 
split. Furthermore, in each iteration, Stage 2 at least fully connects the client 
j and considers all sites in the cluster S centered at j . More importantly, the 
invariant Vj G C : £ ie jr. Vi — is maintained for choosing the feasible cluster 
S in Stage 2.1. This is true in the first iteration. In the subsequent iterations, 
the invariant still preserves because for any j with Tj fl <S ^ and is not fully 
connected in the current iteration, in the next iteration, Eiejr Vi decreases by 
at most f] o (because Stage 2.1 splits sites to maintain Xaes Vi = r lo anc i $ is 
excluded from Tj in Stage 2.2.2) and fj decreases by exactly rj o (from Stage 
2.2.2). Therefore, the overall algorithm is correct. 

Furthermore, the time complexity of the rounding stages of Algorithm 1 is 
O (n 3 ) since each iteration of Stage 2 at least fully connects one of n c clients 
which takes time O (n 2 ). In the following, we separately bound the partial so- 
lution costs incurred in the stages involving rounding and then combine these 
costs for achieving the approximation ratio. 

Lemma 1. After pruning and rounding, the partial total cost from Stage 1 is 
Ejec r ^ a j ~ ^ieJ 7 R iZ i ■ 
Proof. MieV : 

E jec Nil a *j = E K-l + E &3 

3ec j ec 

= EKK + E E K-K- 
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= E \ x *i\ c v + E K-l % 

= EK-K + i M E /% + E /%) 

= EKl c «+^E^ 
= E K-l c ^ + ^ + ^ 

The first equality is due to the condition (CI), the third, fourth and fifth 
is because by (C4) we have Vz e V, j E C : if > then x* i} = y* = R l , 
so x*j < y* — Ri implies /3*- = (by the contraposition in logic) and also 
J2j- x *.<y*=Ri Ptj — 0- The last equality is obtained from (C2), and the fact that 
Vz e V : y* =Ri> 0. 

Summing both sides over all i e V, we can then bound the cost of Stage 1: 



E ie v Ej-gc Kl c ^ + fii/i = E E Kl a j - E ^ 

iev jec ier 

= E^-E i? ^*- 

The second equality follows from the Stage 1 Vj G C : r,- = Eie-p [^ij] i an d 
the condition (C5): if z* > then y* = Ri, so y| < Ri implies z* = . 

Lemma 2. After finishing rounding y, the partial site facility opening cost from 
Stage 2.2.1 is at most J2ier\v fiVi- 

Proof. Facilities at sites ieSC T\P are opened in /j's non-decreasing order 
in Stage 2.2.1: In any iteration of the algorithm with picked cluster S, before 
rounding we have £\ 5 y* = r~j o ; after rounding set S is formed starting from 
the cheapest site in S s.t. X^'es?^ = r ~ja- This makes the opening cost of all 
sites in cluster S at most ^2 iG g fill*- The lemma then follows from the fact that 
all chosen clusters are disjoint in the algorithm. 

Lemma 3. After finishing rounding x, the partial connection cost from Stage 
2.2.2 is at most 3^ - £C rja*. 

Proof. Let site i lie in the cluster S centered at j a . If j is already connected to 
i (x*j > 0), then ctj < a* from the condition (CI). Otherwise, if j connects to 
i after rounding, from the algorithm, it implies a* o < a* (because j with the 
smallest a* is always chosen as j ) and Tj n <S ^ 0. Fig. 2 then displays the case 
T 3 ; n S 7^ where initially j connects to i' and it is connected to i after rounding. 
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By the triangle inequality, we have cy < Cj/j + Cij a + Ci'j a ■ Also, it is true that 
x t'j> x ij i x i'j ^ ^' so f rom (CI) we have Cj/j < a* and Cjj o , Cj/j o < Hence, 
c ij < a j + < 3a*. Summing up both sides of this connection cost bound, 
we have Yliepyp Ejgc Ci J — ^ Ej £ e ^j a j anc ^ ^he fonima follows. Note that Fig. 
2 does not show multiplicity of the connection between any client and site in an 
FTRA solution. It is merely for simplicity and will not affect the correctness of 
the proof. 



S 




Fig. 2. Illustration of bounding the connection costs 



Theorem 1. Algorithm ULPR is J i -a r p r proximation for FTRA. 

Proof. Adding up the partial cost bounds stated in the previous lemmas, the to- 
tal cost cost (x, y) is therefore at most X^eC ^j a j ~ Eiejp Ri z t + Eie;F\p fiVi + 
: >y.,. e 'VV Also > we have cost(x*, y*) = J2^f t y* + T,ie^T,jee c ij x *j = 
cost (a*, [3*, z*) = J2 jeC r 3 a *j-T,ier R * z t = Ejec r >j+E 3GC r 3 a j-T,ieF R i 
so cost (x, y) < Y.jec Ri*t + (j2 je c r "j a j + Ej e c r l a j - E l£ ^ R%4 

3E ieC ^«i =^E j& cr j a* + 2^: jeC f j a*-2j: i ^R i z*=4cost(cx*, (3*, z*)- 
2 (j2j €C fja* — Ei&F-^^i) — 4cos£ (x* , y*). The last inequality follows from 

the fact that (j2j e c^3 a *j — Eie^^i^*) i s the cost of Stage 1 (cf. Lemma jlj 
which is nonnegative. 



4 Reduction to FTFL 

Recently, the authors in [55] proposed a demand reduction trick that reduces any 
FTRAoo instance with arbitrarily large rj to another small FTRAoo instance 
with polynomially bounded rj . The direct consequence of this is that FTRAoo is 
then reducible to FTFL, since we are able to naively split the sites of the small 
FTRAoo instance and the resulting instance is equivalent to an FTFL instance 
with a polynomial size. Because FTRA and FTRAoo have similar combinatorial 
structures, the question then becomes whether FTRA reduces to FTFL as well. 
In the following, we give an affirmative answer to this with the instance shrinking 
technique. 
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Compared to the reduction in |29| for FTRAoo, first, the instance shrinking 
technique for FTRA does not rely on the complete assumption of the optimal 
solution (x*, y*) of LP Q. Recall the property (P2) that (x*, y*) is complete 
when Vj E C : if x*- > then x* } = y*. The reason is that in FTRA, the 
number of facilities at site i is bounded by R4. This will cause inconvenience 
in splitting sites to construct the complete solution and then continuing the 
reduction. So rather we follow the similar settings in (28] for FTRAoo, which 
does not require (as*, y*) to be complete. Second, the technique is more general 
than the demand reduction. This is because the purpose of the technique changes 
to reducing any FTRA instance with arbitrarily large Ri to another small FTRA 
instance with polynomially bounded Rf, which works for FTRAoo as well since 
an FTRAoo instance can be treated as an FTRA instance with all R^s set 
to be maxj G c r j- The small FTRA instance is then equivalent to an FTFL 
instance with a polynomial size Q^e^-Hf), implying FTRA and FTFL share 
the same approximability in weakly polynomial time. Third, in order to bound 
Rf polynomially, the technique heavily relies on the following subtle observation 
for carrying out instance shrinking. 

Claim, (x* , y*) remains to be the optimal solution even if Ri is replaced with 
\yt\ in LP §. 

Proof. Denote the instance with parameter Ri as l a , and 1 after replacing Ri 
with \y*~\ . On one hand, solving T will not yield any better optimal solution 

^a;*, y*^J with cost (x* , y*^J < cost (x* , y*), because this ^a;*, y*^J is also fea- 
sible to 1 Q , which contradicts the optimality of (a;*, y*) for I a . On the other 
hand, cost (x* , y*^j > cost (x* , y*) is not possible since (a;*, y*) is also a fea- 
sible solution to T as y* < \y*~\, which contradicts the optimality of (x* , y*^J 
for X. Hence, (x* , y*) stays optimal for X. 

Now in the reduction phase of the instance shrinking technique, initially we can 
consider the equivalent FTRA instance X with V? E T : R L — \y*~\ and the same 
optimal solution (x* , y*). Then, (x* , y*) is split into a large integral solution 
with y\ = max(0, [y* \ — 1) and x\a = min([a;* J J , y\), and a small fractional 
solution with yf — y* — y\ and xfj = x*j — x\j, for all i e J, j e C. Let the 
tuple (F, C, /, c, r, R) represent the instance X, the reduction then proceeds 
by splitting X into a large instance X 1 : C, f, c, r , R ) and a small instance 
X s : (J 7 , C, f, c, r s , R s ) according to (x l , y') and (x s , y s ). In particular, these 
two instances differ at two parameters r and R, where we let r - — J^ie^-^ij' 
rj = rj — r l j and R\ — y\, i?f = \y*~\ — y\. Note that although the above splitting 
idea of the technique is inspired from the demand reduction for FTRAoo, the 
focus on splitting i?, is essentially different from reducing rj. Also, here we can 
see that the construction of the shrunken instance X with Ri = \y*~\ is crucial 
for bounding Rf, since if the original Ri is used, Rf can not be bounded and 
the technique will not work. In the following, the first lemma is mostly from 
the original splitting idea where we provide a simpler proof for it. The second is 
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directly from our instance shrinking and splitting on Ri. As shown later in the 
proof of Theorem[2] these lemmas are necessary for the approximation preserving 
reduction from X to I s . 

Lemma 4. (x l , y l ) is a feasible integral solution to I 1 and (x s , y s ) is a feasible 
fractional solution to I s . 

Proof. According to the LP Q , it is trivial to see the feasibility of the integral 
solution (x l , y l ). For the fractional solution (x s , y s ), since rj — Y^ieF x ij> r j = 

EiG^^ij) r j = r j - T \ and x tj = x *j ~ Ap we have r j = I2i&F x ij and the 
first constraint of the LP holds. Further, it is easy to see yf < i?f and we 

are left to show the second constraint Vi G J 7 , j G C : yf — xfj > holds. 

i.e. y* — y\ > x*j — min([x* 3 J , ?/■). Consider two cases: 1) y\ < [x*j\, then 

the inequality obviously follows from y* > x*^\ 2) y\ > [x*j\ , the inequality 

rhs — x*j — [x*j\ , and Ihs = y* — max (0, [y* \ — 1) after substituting y\. Now 

again consider two sub cases: 2.1) [^J > L then Ihs > 1 while rhs < 1, so 

Ihs > rhs and the inequality follows; 2.2) [^J < 1, then Ihs = y* , and since 

1 > y* > x*p \_x*j\ = and rhs = x*j, then the inequality follows. Overall, 

(xfj, yf) is a feasible solution. 

Lemma 5. For the instances X 1 and I s the following holds: 

(i) max ieC r\ < R\ and m&x jeC rj < Rf. 

(ii) Rf G {0, 1, 2}. 

Proof, (i) The previous lemma and the constraints of the LP Q together ensure 
the bounds that Vj G C : rj < E ie ^^j < Eie^J/i < 22^% and r j < 

(ii) We have Rf = [y*]-max(0, [y*\ — 1) after the substitution of y\. If 
[y*J > 1, then R\ = \y%\- [y* \ + 1 G {1, 2}. otherwise if [y* \ < 1. then Rf = 

r%*le{o,i}. 

Theorem 2. // there is a p- approximation polynomial-time algorithm A for 
FTRA with polynomially bounded Ri, then there is also a polynomial-time p- 
approximation algorithm A' for FTRA. 

Proof. We will describe such an algorithm A'. It first does the instance shrinking 
and splitting as described before for any instance I of FTRA. From (i) of Lemma 
[H] the split instances I 1 and I s are valid. From (ii). I s has polynomially bounded 
Rf. Note that if Rf = 0, we can safely remove this site i in I s , and set the 
solution Vj G C : a;",-, yf = when later combining it with l l . Then, A' uses A 
as a subroutine to solve I s to obtain a feasible integral solution yf). From 
Lemma |4j (x s , y s ) is feasible to I s so cost (cc s , y s ) < p ■ cost (x s , y s ). Finally, 
.4' combines (xfj, yf) with the readily available constructed integer solution 
( x ij> 2/0 f° r Because (a;L, y') is a feasible integral solution to Z z , then when 
combined, they form a feasible integral solution to I as rj + r j = rj and i?' + 
i?f = Ri = \y*~\ . The only thing left is to prove the combined solution from 
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A' is /^approximation, i.e., cost (x l , y l ) +cost(x s , y s ) < p-cost (x* , y*). This 
follows from cost (a;', y l ) + cost (x s , y s ) < cost [x l , y l ) + p ■ cost (x s , y s ) < 
p ■ [cost (x l , y l ) + cost (x s , y s )) and x l + x s = x* , y l + y s = y* . 

Corollary 1. FTRA is reducible to FTFL in weakly polynomial time. 

Proof. Any instance of FTRA with polynomially bounded Ri can be treated 
as an equivalent FTFL instance with the facility size X)ig_F Ri which is also 
polynomial. Then any polynomial time algorithm solves FTFL with ratio p can 
become the algorithm A for A 1 in the previous theorem to solve FTRA with 
the same ratio. In addition, the reduction requires solving the LP first to obtain 
(a;*, y*) which takes weakly polynomial time. 

Therefore, from the above corollary and the result of [3] for the metric FTFL, we 
get the ratio of 1.7245 for the metric FTRA. Also, from the results of [13,9], we 
can deduce that the non-metric FTFL has the approximation ratio of O (log 2 nj 
in strongly polynomial time. This is because Jain and Vazirani 1 13] proved that 
FTFL reduces to UFL with a ratio loss of O (log n) , and Hochbaum [5] showed 
that the non- metric UFL can be approximated with the ratio of O (logn). For 
the non-metric FTRA, we can achieve the same ratio in weakly polynomial time 
due to the reduction to FTFL first. Moreover, in future, any improved ratio for 
FTFL will directly hold for FTRA. 



5 The Uniform FTRA 

The reduction results in the previous section also imply that the uniform FTRA 
reduces to the uniform FTFL, so it achieves the ratio of 1.52 as in [25], but in 
weakly polynomial time. In this section, we show the same ratio can be obtained 
in strongly polynomial time without using the instance shrinking technique, 
but with a primal-dual algorithm and two speed-up heuristics. Note that the 
algorithms in the following work for the general FTRA as well. The uniform 



condition is only necessary in the analysis (Lemma 13). 

We begin with a naive primal-dual (PD) algorithm (Algorithm 2) for FTRA 
with an approximation ratio of 1.61 and then present the first speed-up heuris- 
tic to improve the complexity of the algorithm to strongly polynomial O (n 4 ) . 
W.l.o.g., the PD algorithm assumes that each client j makes rj connections 
and each connection is associated with a port of j denoted by (1 < q < rj). 
Also, the function <^(j^) represents the facility/site a client j's q-th port is 
connected with and the variable pj keeps track of the port of the client j to 
be connected. The algorithm then gradually connects clients in the port order 
from 1 to rj, as well as increasing the solution (x, y) from (0, 0) in its actions 
in response to some events controlled by a global time t that increases mono- 
tonically from 0. All events repeatedly occur until all clients are fully-connected, 
i.e., the not-fully-connected clients set U = 0. At any t, the payment of any 
client j to a site i is defined as t, and the contribution is max(0, t— c^j) for 

the clients in U and max ^0, max, c^/ . ( ,)^ — c^ for the clients in C\U. As t 
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increases, the action that a client j connects to a facility of i {x^j is increased 
by 1) happens under two events: Event 1. j's payment reaches the connection 
cost of an already opened facility at i that j is not connected to (implying 
at this time yi > x^); Event 2. sum of contributions of all clients to a closed 
facility at i reaches its opening cost /j. In particular, if yi < Ri, Event 2 triggers 
the action that a new facility at i is opened first (j/i is increased by 1). Then 
any client j G C\U with max, c 0(j(9))j — c ij > will switch one of its most 
expensive connections from argmax^^^ c ^,fjM\j to i; and the client in U with 
t — > will connect to i. In addition, for analyzing the approximation ratio 
of PD, each port is associated with a dual variable a* which is assigned the 
time t at which gets connected. From the algorithm, it should be obvious 
that cost(x, y) = 53, e cXa<«<r a< j- Note that Event 2 on i stops occurring 
once yi — Ri and this introduces some difficulties to the analysis. To tackle 
these difficulties, we use an extra variable x to store the numbers of the clients' 
connections when they just become fully-connected. 



Algorithm 2 PD: Primal-Dual Algorithm 



Input: T, C, f, c, r, R. Output: (x, y). 

Initialization: Set U — C, Vi G IF, j G C : x^, yi = 0, Pj = 1. 

while W / 8, increase time t uniformly and execute the events below: 

- Event 1: 3i G T , j G U: t — dj and Xij < yi. 

Action 1: set cj> (j( Pj ^ Xij <s— Xij+1 and a - 3 <s— t; lipj = rj, then U <s— U\ {j} 
and xij ~ Xij, otherwise pj <— pj + 1. 

- Event 2: 3i G T: E iGM max (0, t - cy) +E jeC \w max (°> max <? c 4,(jM)j ~ Ci i) = 
fi and yi < Ri. 



Action 2: set yi <— y» + 1; Vj G C\W s.t. max 



c. 



set ij 4- argmax^ j(s) j c 0(j(<?))j' <~ ^'j ~ X > ^3 <~ E »J + 1 and 

/ I arg maxg c ( (q) \ 

<j> jV ^l 3 W I <s— i; Vj G U s.t. i > Cy : do Action 1. 




Remark 1 // more than one event happen at time t, the algorithm processes all of 
them in an arbitrary order. Also, the events themselves may repeatedly happen at any 
t because more than one facilities at a site are allowed to open. 



Lemma 6. Algorithm PD computes a feasible solution to the uniform FTRA 
and runs in O (n 3 max^gc rj) . 

Proof. The solution is feasible because (a;, y) produced from PD is feasible to 
LP 0. Each iteration of PD at least connects a port of a client, so there are 
maximum ^Ziec r i iterations. In addition, similar to Theorem 22.4 of [16_ and 
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Theorem 8 of [14] for UFL, the client switching in Action 2 dominates the 
time complexity. In each iteration, the switching takes time O (n c rif) to update 
clients' contributions to other facilities for computing the anticipated times of 

the events. Hence, the total time is O fX}jeC r j n c n f^j > i- e - ^ ( n3 max jec r j) ■ 

For a facility location problem modeled by an LP, we observe that the analysis 
approaches like dual fitting [11 and inverse dual fitting |26I19| are both based on 
the constraints of the LP. In the following, we develop a simple and systematic 
constraint-based analysis together with the factor-revealing technique of [TT] to 
derive the ratio of 1.61 for FTRA. 

The PD algorithm produces a feasible primal solution (x, y) to FTRA but an in- 
feasible dual solution (a, (3, z) if we simply let otj — a- 3 . f3ij — max (0, ctj — Cij) 
and Zi = in LP This is because although the LP's second constraint 
holds, the first constraint fails to hold from the events of the algorithm. How- 
ever, through the steps of properly constructing the dual variables, considering 
relaxing the feasibilities of the constraints, and proving with the factor-revealing 
technique that the algorithm can actually ensure these relaxed constraints, the 
algorithm's approximation factor can be eventually derived. First, we bound the 
primal solution cost cost (x, y) with the dual solution cost cost (a, f3, z) using 
the following dual construction of a, z: 

!xl j (^ctj —&j J ^ 
iix"ij < Rj 

In the above setting, x stores the primary connection information of the 
clients after they become fully-connected but before they switch any of their 
connections; Vi £ J- . j G C : hj denotes the last port of j connecting to i before 
switching; aj 3 is the dual value of the port Uj and a? is the dual of j's last 
port. The other dual variable (3 is to be constructed later in the analysis. 

Lemma 7. cost(x, y) < cost (a, /3, z) where (x, y) is the feasible primal so- 
lution produced from the PD algorithm and (a, /3, z) is constructed from above. 



Proof. 



cost (a, f3, z) = S~] rjCtj — RiZi 



Y Y x v a 7 YY y - v - " , 





if x"ij<Ri 



Y Y x » a 7 YY ^ ("• - a ¥ ) 



jec i&T 



if x\ i= Ri 

Y Y x ^ a 7 ■ YY 



iGT jec 
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Hence, cost(x,y) = Ejec Ei< g <r 3 - "j ^ Y2J2 x "^ a 7 1 X! X! = 

v ' v v ' 

itxij<Ri ifx"ij— Ri 

cost (a, (3, z). 

Second, we exploit the dual constraints of the LP (|3|. Before going into this, we 
have the basic definition below. 

Definition 1. An algorithm is bi-factor (pf, p c ) or single factor max(p/, p c )- 
approximation for FTRA, iff for every instance I of FTRA and any feasible 
solution SOL (possibly fractional) of I with facility cost Fsol and connection 
cost CqoLi the total cost produced from the algorithm is at most PfFsoL + 
PcCsol (pf, Pc are both positive constants greater than or equal to one). 

In the definition, let any feasible solution be SOL = (x", y"), then Fsol = 

Ei G ^ fiV'U C sol = Eie^- Ejec c H x ij and cost ( x " v") = f sol+C S ol- In the 
following, we consider the feasibility relaxed dual constraints with the relaxation 
factors pf and p c : 

(C6) Vi G T , j G C : ctj — fiij < p c Cij. 
(C7) VieF: E j&C Pii<Pffi + z i- 

Next, we show that if the dual variables (a, (3, z) satisfy these relaxed con- 
straints, the corresponding dual cost will be bounded by any feasible primal cost 
scaled by the factors pf and p c . 

Lemma 8. If (a, (3, z) satisfy (C6) and (CI) while SOL = (x" , y") is any 
feasible primal solution, then cost (a, (3, z) < pfFsoL + PcCsol- 

Proof. Since (x" , y") is any feasible solution, all constraints of the LP Q should 
hold first. Together with (C6) and (C7), we have: 



cost (a, f3, z) = ^2 rjOij — Y, RiZi 

< Y Y [Pvv'i + ( q j ~ x 'io] - Y y'i Zi 

< Y (pff* + 2 «)!'i+EE Pecan"] - Y y '^ Zi 

= Y PffiVi + Y Y P cCi 3 x ij = Pf F soL + PcCsol 
The previous two lemmas and the definition immediately imply the next lemma. 



Lemma 9. The PD Algorithm is (pf, p c )- approximation if (ex, (3, z) satisfy 
(C6) and (CI). 
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In order to satisfy (C6), we set Vz e F, j E C : fcj = max (0, ctj — p c Cij) (because 
Pij > 0), thereby finishing constructing (a, /3, z). The last step is to find the 
factors pf and p c s.t. (C7) is also satisfied. The next lemma and corollary are 
the more specific forms of the previous lemma, after substituting the setting of 
(a, [3, z) into (C7). 

Lemma 10. The PD Algorithm is (pf, p c ) -approximation if Vi € F : 
J2jeAi ( a ? ~ PcCij - < Pffi where At = {j G C \ (// > p c Cij}- 

Proof. After the substitution, (C7) becomes: 

Vie F : J2 je c (Aj - fy) < 

Vie J 7 : J2jec ( max (°> a ? ~ PcCy) - 7Ty) < 

Vie F : ^2 - nj - PcCij) - TTy < 
j'ec jec 

if a] 3 > p c Cij if a. 3 <p c Cij 

Therefore, since 7Tjj > 0, it is sufficient to prove Mi e F : J2jeAi { a 7 ~ n ij ~ 
where A% = { j € C | a^ 3 > p c Cij} to satisfy the original (C7). 

If we set Vi e 7, j £ C : u,,- = a'- 3 — n—, then uu = (~ 3 r 13 % and we 

1 " 3 ifx'ij < l>, 



have the corollary below. 



Corollary 2. W.l.o.g., for every site i, order the corresponding Hi — \Ai\ clients 
in Ai = {j € C | ct 3 > p c Cij} s.t. un < . . . < Minj- T^e? 1 the PD Algorithm is 
(p f , p c )- approximation if Mi e F : YljLi i u ij - PcCij) < Pffi- 

In addition, for each i, any subset of the clients are ordered from 1 to fcj s.t. 
u n < • • • < Uiki - Now, we proceed the proof to find pf and p c with the following 
lemmas. These lemmas are needed for the factor-revealing technique and they 
capture the properties of the PD algorithm for the uniform FTRA. 

Lemma 11. For every site i, at time t = Uij — e, VI < h < j < fcj let uj l h ■ = 
I Uih ifxi h = Ri 

{ maX q C 4,(hM)h ifXih<Ri ' 3 ■ 3 + L 

Proof. If x'ih — Ri, then uj l h ■ = uj l h . +1 (at time t = u i ^ +1 - ) —e) = Uih- Otherwise, 
xlh < Ri implies u^h — ct~ h h , so h is fully-connected at time t since Uih < 
Therefore, oo l h > u l h +1 because a fully-connected client's ports always 



v 



reconnect to the sites with less connection cost, so its maximum connection cost 
will never increase. The lemma follows. 

Lemma 12. For any site i and ordered fcj clients, VI < j < ki : 

EC 1 ! max (o. u h j - Cihj + Y,h=j max (°' u v - ^ Si- 
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Proof. For any site i and at time t — Uij — e, if h < j client h's contribution 
is set to be max ^0, ui l h j — c^A . In particular, from the previous lemma and 
the setting of Uij, if x\h < Rii it implies h is fully-connected at time t and 
the contribution is max ^0, maXjC^^,)^ — c,/^ . In addition, if x\h = Ri the 

contribution is max ^0, a l £ h — Cj/jJ ■ Note that under this case, h still might be 
fully-connected at time t, but because x\h = Ri and following the algorithm, 
its contribution should not be set to max ^0, max, c ^,(hi<i)\fi ~ Ci/l ) ^ or ensurm g 
the lemma. On the other hand, if h > j, h is not fully-connected since t < a 7 ^ , 
so we set the contribution to max(0, t — c^), i.e. max(0, Ujj — Cj/i). From the 
execution of the algorithm, at any time, the sum of these contributions will not 
exceed the facility's opening cost at site i, hence the lemma follows. 

Lemma 13. For any site i and clients h, j s.t. 1 < h < j < ki : ru = Tj = r, 
then < u>lj + Cij + c ih . 

Proof. At time t — Uij — e, if all facilities at site i are already open, then Uij < Cij 
and the lemmas holds. Otherwise, if not all facilities are open, then at time t 
every client h < j is fully-connected. This is because Uih < implies = a l £ h 
or a^ h at the time t. Since h can only connect to less than Ri facilities at i, this 
contradicts the condition x\h = Ri f° r the setting of , so = a^ h . In 
addition, j itself is not fully-connected at t, whereas h is fully-connected and 
has already connected to r facilities. There is at least a facility to which h is 
connected but not j. (This is where we must enforce all clients have the uniform 
connection r.) Denote this facility (site) by i' 7 we have Uij < Ci>j and uj l h ■ > Ci'h- 
Lastly, by the triangle inequality of the metric property, c^j < c^h + + 
and then we have the lemma. 

Ek 

Zk — maximize — = 

/ + E*=id3 

subject to VI < j < k : cij < Oj+i 

VI < h < j < k : r h ,j > r h j + i 

VI < h < j < k : otj < rh,j + dh + dj 

3-1 k 

1 < j < k max ( r h,j - dh, 0) + max (ay - d h , 0) < / 

h=l h=j 

1 < h < j < k : ay, dj, f, r hd > 

Consider the above factor-revealing program series (25) of [TT]. If we let k = 
ki t a>j — Uij,rh,j — u}\j, f — fi, dj — c^, from the previous property lem- 
mas it is clear that Uij, u\j, fi and constitute a feasible solution. Also, 
from Lemma 5.4 and Theorem 8.3 of [TT], and Lemma 4 of [22] we can directly 
get Vie J " 1.61c«) < 1.61/,, Y*Li " 1-78^) < 1.11/, and 

2j=i ( u ij ~ % c ij) — fi- Furthermore, because m — \Ai\ and ki represents the 
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number of any subset of the clients, Lemma [6] and Corollary [2] directly lead to 
the following theorem. 

Theorem 3. Algorithm PD is 1.61-, (1.11, 1.78)- and (1,2) -approximation in 
time O (n 3 maxj g e r j) f or the uniform FTRA. 



Algorithm 3 SPD: Speed-up of Primal-Dual Algorithm 
Input: F, C, f, c, r, R. Output: (x, y). 

Initialization: Set U = C, Vi G F, j G C : Xij, y, = 0, FCj = 0. 
while W / 8, increase time t uniformly and execute the events below: 

- Event 1: 3i G F, j G U s.t. t = dj and Xij < yi. 
Action 1-a: ToC <s— min (j/; — x^, rj — FCj); 
Action 1-b: set Xij 4- Xij + ToC and FCj «- FCj + ToC; 
Action 1-c: If FCj = r 3 then U <- U\ {j}. 



- Event 2: 3i G F: J2 je u max (°> * _ +I^ 3 gc\w max (°> max i'e.F&&a Vj >o <Vj - 




/i and y z < Ri. 

Action 2-a: U t {j £ U \t - dj > 0} and AC <- min jeWi (rj - FCj); 

Action 2-b: Si 4- jj G C\U\ maXii e jr bScXi , j>0 c V] - dj > 0}, Vj G Si : i* «- 

argmax i ' e jr &&a ,., ; . >0 G i j and iVS «- minxes, x^; 

Action 2-c: set ToC «- min (AC, NS, Ri - yi) and y t <- j/j + ToC; Vj G : 
Kj^j <s— — ToC and iy <— Xij + ToC; Vj G : do Action 1-b; 
Action 2-d: Vj G Ui : do Action 1-c. 

Remark 2 .For £/ie convenience of analysis, sequential actions of the events are sepa- 
rated as above. If more than one event happens at the same time, the algorithm process 
Event 2 first so that no repeated events are needed. 



The previous PD algorithm runs in pseudo-polynomial time depending on rj . 
With the speed-up heuristic described in the following, the algorithm can then 
change to an essentially identical algorithm SPD (Algorithm 3) which is strongly 
polynomial. In the heuristic, (x, y) is able to increase at a faster rate rather 
than 1, through combining the repeated events into a single event and thereby 
reducing the total number of events to process. In particular, for Event 2, once a 
facility of a site i is opened and connected with a group of clients' ports, according 
to the PD algorithm, additional facilities at i may subsequently open and connect 
with this group of clients' other ports until their sum of contributions (SOC) 
becomes insufficient to pay f i , or j/j = Ri . The SOC is not enough any more if a 
client in U appears to be fully-connected, so Yljeu max (0; * — c »i) wm decrease, 
or the most expensive connection of a client in C\U differs (after switching), in 
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this case J2jec\u max (0j max i'e.7 r &&x j /. j >o Ci'j — Cijjjjwill decrease. Similarly, 
for Event 1, once a client j's port starts to connect to an already opened facility 
at a site i, its other ports may get connected to i at the same time until either 
there are no remaining open facilities at i or j reaches rj connections. 

Formally in the SPD Algorithm. FCj denotes the number of established 
connections of client j and ToC the total number of connections decided to 
make according to the heuristic. The incremental rate of (x, y) can then be 
determined by ToC instead of 1. Moreover, in the more complicated Event 2 on 
a site i, NC denotes the maximum number of connections to make until one of 
the clients in IA gets fully-connected, and N S the maximum number of switches 
until the most expensive connection of a client in C\U changes. Therefore, ToC 
is calculated as min(jVC, NS, R4 — y,), the maximum number of connections 
until the SOC becomes insufficient or yi = E4. 

Lemma 14. Algorithm SPD computes a feasible solution to the FTRA and runs 
in O (71 4 ) . 

Proof. The solution is feasible because SPD is essentially the same as PD ex- 
cept the implementation of the speed-up heuristic. Also with this heuristic, the 
number of Event 1 is at most nfn c because for any client j and site i only when 
t = Cij, j exhaustively gets connected to open facilities at site i, and there are 
rif sites and n c clients in total. So the numbers of both Action 1-a and 1-b are 
bounded by n/n c , while Action 1-c is bounded by n c since there are n c clients 
to be connected. 

Moreover, the number of Event 2 can be bounded by n c n f instead of X^jp jf R-i ■ 
This is because each Event 2 will cause at least one of the following 3 cases: (1) 
a client j in U becomes fully-connected; (2) a client j in C\U switches all of 
its most expensive connections; (3) a site open all its facilities. It is easy to see 
that there are maximum n c and n/ cases (1) and (3) respectively, so we are 
left to bound the number of case (2). For this case, it is important to observe 
that any client j has at most n / possible sets of connections where connections 
in each set associate to the same site. So there are at most n c nf such possible 
sets in total, and each case (2) at least removes a possible set, i.e. at least a 
client's connections have one less possible site to further switch to, since clients 
only switch to cheaper connections. Therefore, there are at most n c n/ case (2) 
and (n c + Uf + n c rif) Event 2. So the numbers of Action 2-a, 2-b and 2-c are 
bounded by O {n c nj) while Action 2-d is included in Action 1-c. Lastly, as in the 
PD algorithm, the switching action Action 2-c dominates the time complexity 

of all actions which takes O (n c nf), hence the total time is O ^c 71 /) ■ 

The algorithm computes the same solution as PD, so we obtain the following 
theorem. 

2 For simplicity of the algorithm description, we replace the term 
^2jec\u max max ? c 0(j(<!))j ~ c »i) m t ne PD algorithm with essentially the 
same term here. 
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Theorem 4. Algorithm SPD is 1.61-, (1.11, 1.78)- and (1 ,2) ] - approximation in 
time O (n 4 ) for the uniform FTRA. 



Algorithm 4 SGA: Speed-up of Greedy Augmentation 

Input: !F, C, f, c, r, R, (as, y). Output: (a;, y). 
Initialization: 

for j G C / /optimize the total connection cost first 

for i 6 T and y, > 0, in the increasing order of distances w.r.t j 
Xij i- min (rj, yi) 

r j *~ r 3 ~ x ij 

set residual vector y <— R — y / /for detecting the case yi reaches Ri 
set CC <r- X^jgc Ci o x ij as the current total connection cost 

invoke calculateGain 

while maxigjr gain (i) > 0: //if gain(i) > 0, then yi > from the calculateGain 
function 

• i •* qain(i) 

pick i = argmaxigjr K 1 

Si <- |j G C | max j / e ^ &&a . 4 ^ >0 c i /j - cej > o| 
Vj G 5i : i* <- argmax i ' e jr &fex .,_. >0 c i /j 
iVS <s— min_, e s i Xjtj, ToC <— min (NS, yi) 
set j/i* ^— yi* + ToC 

A <— //Z\ stores the total decrease in the connection cost after all switches 
for j G Si 

A «- zA + ToC ■ (maxj/gjjifa^jo - Cj»j) 

set a?i* j «— x t *j — ToC and «— + ToC 
set CC ^CC - A 
update y 

invoke calculateGain 

function calculateGain 
for i 6 J 

d CC //for each i, C; stores the total connection cost after connections 
are switched to i 
gain (i) ^— 
if Hi > 

for j G C 

if maxi'ejr&&a i , >o <Vj > cy 

Ci <— Ci — max i / e jr && , Cj >0 c^/j + cy 

gain (i) <— CC — d — /» 



In order to further achieve the factor of 1.52 in strongly polynomial time 
that matches the best result [25 for the uniform FTFL, it is necessary to apply 
the cost scaling and greedy augmentation (GA) techniques |25I8| for FTFL to 
FTRA. However, like in [18 27 the difficulty encountered is the application of 
greedy augmentation (GA) in polynomial time, since the naive way of treat- 
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ing an FTRA/FTRAoo instance as an equivalent FTFL instance and then di- 
rectly applying GA after cost scaling will result in weakly polynomial or pseudo- 
polynomial time algorithms, depending on whether using the instance shrinking 
technique in the previous section or not. 

Nevertheless, if GA is applied with another similar speed-up heuristic, it 
changes to the algorithm SGA (Algorithm 4) which runs in strongly polynomial 
time. Before describing SGA, we take a brief look at GA in [5] for FTFL. It 
defines gain (i) of a facility i to be the decrease in total cost (decrease in total 
connection cost minus increase in facility cost of i) of the solution after adding 
a facility i to open and connecting clients to their closest facilities. Note that 
once a set of open facilities are fixed, the total connection cost can be easily 
computed since every client simply chooses these facilities in increasing order of 
distance. GA then iteratively picks the facility with the largest gain ratio ga "M 
to open until there is no facility i with gain (i) > left. On the other hand, SGA 
computes gain (i) in the same way as GA. The difference is in FTRA there are 
~}2 ie jr Ri facilities in total, therefore it is slow to consider one facility at a time (in 
each iteration of SGA). Fortunately, there is also a speed-up heuristic: because 
all facilities at a site i has gain(i), once a facility at site i m with maxj ga "M 
is selected to open, additional facilities at i m may also open at the same time 
until either (1) this maximum gain(i m ) reduces due to insufficient decrease in 
the total connection cost; or (2) yi reaches Ri. Moreover, (1) happens once a 
client has appeared to switch all of its most expensive connections to i m . which 
is similar to the switching case in the previous algorithm SPD. 

Formally in the SGA algorithm, CC denotes the current total connection cost 
and Cj the connection cost after i is opened and client connections are switched. 
The calculateGain function computes gain (i) and the while loop implements 
GA with the described heuristic. In each loop iteration, for updating CC, A 
stores the total decrease in the connection cost after client switching. Following 
the heuristic, ToC and NS are defined similarly as in the SPD algorithm. Note 
that in the initialization phase of SGA, the total connection cost is optimized 
first so that every client connects to its closest facilities. This is to ensure that 
in every iteration only the client connections with the largest costs need to be 
considered in computing the best possible connection cost Ci. 

Lemma 15. Algorithm SGA runs in O (n 4 ) for FTRA. 



Proof. Each iteration of the while loop runs in O (n c nj) due to the calculateGain 
function. Now, we bound the the total number of iterations. Similar to the run- 
time analysis of the algorithm SPD (c.f. Lemma 14 1, in SGA once a site i m with 
the maximum gain is chosen, SGA opens the facilities at i m until either R im is 
reached, or a client has appeared to switch all of its most expensive connections, 
causing reduced maximum gain. Further, there are at most nt chances to reach 
Ri m and n c nj possible sets of connections for all clients. Since clients also only 
switch to cheaper connections, there are maximum (n/ + n c fif) iterations. The 

total time is therefore O (ji^.njj . 
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Now the important observation/trick for the analysis is that applying SGA to an 
FTRA/FTRA^ instance (with solution) obtains essentially the same solution 
(also the same cost) as treating this instance as an equivalent FTFL instance 
(by naively splitting sites) and then directly applying GA. The difference is, with 
the speed-up heuristic, SGA is able to arrive at this solution faster, in strongly 
polynomial time. The observation then implies that SGA alone improves the 
3.16-approximation result of [27] for the general FTRAoo to 2.408 in polynomial 
time using the GA results [5J for FTFL. Similarly, for the uniform FTRA, SGA 
combined with cost scaling further improves on the (1.11, 1.78)-approximation 
algorithm SPD according to the results of [25 for the uniform FTFL. 

Theorem 5. The uniform FTRA can be approximated with a factor of 1.52 in 
time O (n 4 ) . 

6 The Uniform KFTRA 

Lastly, we consider the Constrained Fault- Tolerant fc-Resource Allocation (KFTRA) 
problem and show its uniform case achieves an approximation ratio of 4. In this 
important variant of FTRA, there is an additional constraint that at most k 
facilities (max^gc r j < k an d k < X^g.F across the sites can be opened as 
resources. This problem has the following formulation. 

minimize J2 ie r fiVi + £ie.F EjeC °io x ij 
subject to V j G C : X^gjf x ij — r j 

Mi G T,3 G C : yi - x io > 

Vi < k 
Mi G T : yi < //, 
Vi G T , j G C : , yi G Z + 

Its LP-relaxation (primal LP) and dual LP are: 

minimize Y^ieT fiVi + E ie ^ EjeC c v x v 
subject to Mj G C : J2 ie jr > rj 

yi G T , j G C : yi~ x-ij > 

J2ie^ Vi ^ k 
Mi G J 7 : yi< Ri 
Mi G J 7 , j G C : x^, yi > 

maximize Ejec r i a i — Eie-F-^ z * — ^0 
subject to Vi G J 7 : X^ e c /% ^ /» + z i + ^ 

Mi E F,j E C : ctj — Pij < Cij 

Mi G T,j G C : Oj, ftj, Zj, > 

It is clear that KFTRA generalizes the well studied KUFL [ PUTT] and 
KFTFL problems. In the following, besides adapting the algorithms and 
analyses therein, we also develop a greedy pairing (GP) procedure which in 



(4) 



(5) 



(6) 
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polynomial time constructs paired and unpaired sets of facilities from sites for 
randomly opening them afterwards. 

Algorithm Description. The algorithm PK (Algorithm 5) consists of three 
sequential procedures: Binary Search (BS), Greedy Pairing (GP) and Random- 
ized Rounding (RR). BS utilizes the previous (1, 2)-approximation algorithm 
SPD (Algorithm 3) for FTRA with the modified input facility cost 2 (/* + 9), 
i.e. the cost is increased by 9 first and then scaled by 2. As we will see later in the 
analysis, this modification is necessary for two reasons: 1) the Lagrangian relax- 
ation of KFTRA is FTRA; 2) the scaling of the facility cost enables us to build 
on the approximation ratio (1, 2) of FTRA for getting the ratio of KFTRA. For 
simplicity, let SPD (6*, A) denote the parameterized SPD algorithm with the in- 
put facility cost perturbing factor 9 and scaling factor A, so SPD(0, 1) produces 
the same solution as SPD. From LP ([lj and it is clear that SPD produces 
an almost feasible integral solution to KFTRA except that it has to guarantee 
at most k facilities to open (^2 ie jryi < k) from all sites. This guarantee might 
not be even possible, but fortunately we can use SPD(#, A) to get two solutions 
(x s , y s ) and {xi 1 yi) with the small one having ^2 ie jry s .i = k s < k facilities 
open and the large ^2 ie jrVi.i = fc; > k. A convex combination of these two so- 
lutions is able to give a feasible fractional solution (a/, y') to KFTRA instead, 
i.e. (a/, y') — a (x s , y s ) + b (xi, y{) with a + b = 1 and ak s + bki — k. The so- 
lutions can be obtained by binary searching two values of 9 (6\ and 62) over the 
interval [0, naC ™ a * ] where c max — maxigjr^gc Cij and invoking SPD(#i, A) and 
SPD (#2, A). This specific interval is chosen because as the value of 9 increases, 
the number of open facilities from SPD (9, A) will decrease. So if 9 = TIcC r' ax , the 
algorithm will only open the minimum number of max je c Tj facilitiesj^Moreover, 
as shown later, if 9\ and 9i become sufficiently close (e = where c m i n is the 
smallest positive connection cost and N = ^2 ie jr Ri) in BS, the approximation 
ratio of SPD is almost preserved for building a ratio for KFTRA. 

However, for a feasible integral solution (a;, y) with k open facilities, the 
algorithm instead relies our efficient GP procedure. Based on the solution vectors 
y s and yi obtained from BS, GP splits the vector yi into y p and y p s.t. yi = 
Up + Vp an d Ylie t Vs.i — ^Zi^jr y P .i — k s . Note that each of these integral vectors 
represents the facility opening amounts of all sites. To be precise, GP greedily 
constructs the paired (y p ) and unpaired facilities (y~ p ) from yi against the small 
solution y s . In particular, it first pairs the facilities of the corresponding sites 
in y s and yi (both sites with open facilities) and records the pairing result in 
y p . Next, for each left unpaired site i in y s in arbitrary order, GP exhaustively 
pairs the facilities at i with the facilities of the unpaired sites in yi in the order 
of closest to i. In this pairing step, y p is updated accordingly. At the end, y p 
is simply set to be yi — y p . To be more precise, we consider a simple example 
with y s = [3, 2, 0, 2] and yi = [2, 0, 5, 3] from BS before running GP. After the 

3 We noticed that the binary search interval [0, nrc max \ (c.f. the third paragraph of 
Section 7 of |25| ) for KFTFL can be reduced to [0, ] , because once the 

minimum number of maxjgc i"j facilities are opened and all facility costs are at least 
n c c ma x, from the primal-dual algorithm, all clients are already fully-connected. 
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Algorithm 5 PK: Procedures for KFTRA 

Input: A KFTRA instance (F, C, fi, Cij, rj, Ri, k). Output: (x, y) 
Initialization: x <— 0, y ^— 
Procedure 1: Binary Search (BS) 

0! = 0, 6 2 = "cm^^.jee^. 

while 02 — Oi > e do: 

mid = 

invoke SPD with (F, C, 2 (/, + mid) , dj, rj, Ri) and output (x mid , y m id) 

kmid ^^,i£jr Umid,i 
if kmid < 

set 02 = mid 
else if fc mld > fc 
set 9\ = mid 

else 

return mid / /if here reached, all procedures afterwards can be safely ignored 
invoke SPD with (F, C, 2 (fi + 9i) , Cij, rj, Ri) and output (xi, yi) 
invoke SPD with (F, C, 2 (/, + 62) , Cij, rj, Ri) and output (x s , y s ) 

Procedure 2: Greedy Pairing (GP) 

/ / vectors representing numbers of constructed paired and unpaired facilities in yi 
set y v 4- 0, y p 4- 
for i € F 

if y s ,% > and yi^ > 

2/p,< <- y P ,i + min(j/ Sli , y tii ) 

//updating vectors and store them in y B and yi for the next pairing steps 
y2,i <— y s ,i — min (y s ,i, yi,i) 
yi.i *- yi,i ~ min (j/ S]i , y M ) 
for i £ in arbitrary order 
if > 

for i' G J r \i in the order of closest to i 
if ift> > 

y P y *- y P ,i> + min (y2,i, yfy) 
Va,i 4- yli - min (y' s ,i, yfy) 
yfy 4- yfy - min (ylt, yfy) 
Vp 4- Vi - Dp I / at this tim e £) ieJ r fp.' = fcs and E;e^ fi>.» = k t - k s 
Procedure 3: Randomized Rounding (RR) 

choose probabilities a = fc fc /J/ fc fc and & = £^-k so afc s + bfc; = fc and a + b = 1 

set y i — y s with probability a and 1/ y p with probability fe = 1 — a //disjoint cases 

both open k s facilities 

select a random subset of k — k s facilities to open from y v and add these to y / /at 
this time ~^2 ie:F yi = k and each facility in y t is opened with probability b 
//connects each client j to its closest rj opened facilities 
for j G C 

for i G F in the order of closest to j 
Xij 4- min(rj, yi) 
rj <— rj — Xij 
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first pairing step, y p = [2, 0, 0, 2], y s — [1, 2, 0, 0] and yi = [0, 0, 5, 1]. Now 
for simplicity, we assume that the distance between sites i and j is \i — j\ where 
we follow the ascending order of indices j's in resolving the ties of the closest 
distances. Therefore, after the second step, y p = [2, 0, 3, 2], y s = [0, 0, 0, 0] and 
Vi — Hp = [0, 0, 2, 1], since both the unpaired y^i and 1/^2 (1-based index) are 
paired to the closest unpaired 

Based on the y s . y p and y p obtained, the last procedure RR then randomly 
opens k facilities in a way that ensures the expected facility opening cost in y is 
the same as the cost of the opening facilities in the convex combination solution 
y'. Finally, according to y, RR connects each client j to its closest rj opened 
facilities via updating x. 

Algorithm Analysis. The basic idea of the analysis is to first bound cost (x' , y') 
by cost (a' , f3' , z' , 8') where (a' , /3' , z' , 9') is a feasible dual solution to LP 
Then we bound the expected total cost cost (x. y) with cost (x', y') to further 
establish the approximation ratio p s.t. cost(x, y) < pcost(a', (3', z',9'). Fi- 
nally, by the weak duality theorem, cost (x, y) < pcost (a;^, y^j where (x^, y£) 
is the optimal fractional solution to KFTRA (displayed as LP (|5|). 

For the first step, we focus on analyzing the BS procedure to bound cost (a;', y') 
by cost (ex', (3' , z',6'). Suppose SPD(#, 2) produces the primal solution (x, y) 
with k open facilities. We let the cost of (x, y) w.r.t. the original input instance 

be cost (x, y) = F+C, where in the separate costs (^F, Cj, F — J^ier 1S ^ ne 
total facility cost and C = X^e-rSjec c ij x ~ij ^ s the connection cost. Similarly, 
w.r.t. the modified instance, the cost is cost' (x, y) = 2 (f + kO) + C. From the 
analysis (cf . the paragraph before Theorem |3| of the factor revealing program of 
the PD algorithm, for SPD((9, 2), we get Vi e T : T, je c ( u ij ~ 2c u) < 2 (/* + °) 
where C C C, i.e., 

^d,-2^ CjJ -z~-2(/ 4 + 0)<O (7) 
jec jec 

where (^a, /3, z^j is the corresponding constructed dual values of (x, y) from 

the PD algorithm. Further, from Lemma [7] we have a bound for cost' (x, y), 
i.e., 

2 (P + k9 \ + C < r i d i - H Ri£i ( 8 ) 
jec i&r 

. Note that the dual solution ^a, /3, z^j is used only in the analysis. Also, because 
SPD only speeds up PD by combining its events, we can use the dual solution 
produced from PD for analyzing SPD. If we set [a' , z' , = |. 6) and 

Vj £ J, j £ C : /?y = a'j — Cjj, the inequality (7) then becomes J2jec @ij — 
fi + z[ + §', implying (a', /3', z', 6'j is a feasible dual solution to LP Fur- 
thermore, (8) becomes 
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2F + C < 2 E ~ E ^ ~ U ' ( 9 ) 

. The analysis here reveals the Lagrangian relation between KFTRA and FTRA 
from the dual perspective, whereas the Lagrangian relaxation framework (cf. 
Section 3.6 of [13]) starts from the primal. Therefore, if k = k, (x, y) is 2- 
approximation from the inequality (9), the bound cost (x, y) < 2F + C and the 

feasibilities of (x, y) and (a. 1 , (3' , z' , 6'^j . However, as mentioned before, we may 

never encounter the situation k = k. Instead, the BS procedure finds Q\ and O2 
until 8 2 — Oi < e — fpr- It then runs SPD(0i, 2) to obtain the solution (xi, y{) 
with ki > k and the cost (Fi, C/) w.r.t. the original instance; and SPD(#2, 2) to 
get the solution (x s , y s ) with k s < k and (F s , C s ). Hence, from (9) we have 

2F t + a < 2 ( e - E M< - m (10) 

\jec ieT J 

and 



2F S + C s < 2 ( ^ r ja ' s<j - E Ri^i - M2 | (11) 

where (aj, /3 ; ', 2[) and (a^, j3' g , z' s ^ are constructed as [a', f3' , z'J to be 
feasible duals. 

Now we are ready to bound cost (a/, y') by cost (a', /3', z' , 0'). The proof of 
the following lemma builds on the idea of Lemma 9 in [14_ for fc-median. 

Lemma 16. cost(x',y') < (2 + ^) F' + C < (2 + ^) cost (a', /3', z', 6'), 
where N = X« g .f Ri, (x f , y') = a (x s , y s ) + b (x t , y t ), a + b = 1, k = ak s + bk h 

F' = EierMi = aF s + bF h C = £ i£F E JgC cya^ = aC * + hC ^ a ' = 
aa' s + ba' v (3' = a/3' s +b/3' l , z' = az' s + bz[ and9' = 6 2 . Moreover, (a', (3' , z',9') 
is a feasible dual solution to the KFTRA problem. 

Proof. From the constructions of (aj, f3' v zfy and (ot g , f3' a , z'\, we get Vi G F : 

E je e ft,* - f- ■ << + and Ejec < fi + <,i + ^ then E ieC P'ij < 
fi + z\ + 062 + b0\ < fi + z[ + 62 after multiplying the first inequality by a, the 

second by b and adding them together. In addition, with the setting Vi G F, j G 

C : = a'j — Cij, we get the feasibility of (a', f3', z',9') to LP dfjj. Next, we 

aim to derive the following bound 



2 + iy<+Q< (2 + I) (SH-EMr^) (12) 




from the inequality (10). For now, suppose this bound holds, from (11), we 
have 
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. After multiplying (12) by &, (13) by a and adding them together, we get 

2+1)f' + C'<(2+1) (5>a$-5>4-*M (14) 



. This then yields the lemma together with the feasibility of (a' , f3', z', 9') and 
cost (a/, y') = F' + C . The last thing left is to verify in the following that (12) 
indeed holds from the inequality (10), the termination condition of the algorithm 
6 2 - 9i < e = fftf and the fact that C; = J^itrl^jec c ij x i,ij ^ c ™«- 



Ci <2 | 
























For simplicity, let A = fj^jeC r i Q! i j — 



AN 2 



AN 2 



] ieJr Riz' hi - ki8 2 - Fij . Because 
ki < N and N > 1, we get C; < — \— A < (2 + i) A. Hence, the inequality 

1 4JV 2 

(12) is verified. 

For runtime, our BS procedure totally makes O (L + log N + log n) probes (L is 
the number of bits of the input costs) over the interval [0, ^mum j un til the in- 
terval becomes the size of Moreover, each probe takes O (n 4 ) to invoke the 
SPD algorithm, so the total time is O (n 4 (L + log N + log n)) which dominates 
the overall runtime of the algorithm PK. 

For the next step, we focus on analyzing the GP and RR procedures to bound 
cost (x, y) = F + C with cost (x f , y') = F' + C. Note that F and C are the 
expected total facility and connection costs respectively from the randomized 
procedure RR. This procedure can also be derandomized using the method of 
conditional expectation as in |14| for the fc-median problem. In the following, we 
bound F with F' and C with C separately. With probability 1 , RR opens exactly 
k facilities. Specifically, it randomly opens each facility in y p with probability 6, 
and each facility in y p with probability ^~^f which is also b. Since GP properly 
splits the vector yi into y p and y p s.t. yi = y p + y p , we can conclude that 
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each facility in yi is opened with probability b. In addition. RR randomly opens 
each facility in y s with probability a, therefore the total expected opening cost 
is aF s + bFi which is F' . 

Lemma 17. The total expected facility opening cost F satisfies F = F' . 

Next, we bound C with C . Suppose two FTRA instances with the solu- 
tions (x s , y s ) and (xi, yi) are produced from the BS procedure. Afterwards, 
for getting a feasible solution to KFTRA from these solutions, instead we con- 
sider a naive pseudo-polynomial time algorithm. The algorithm first treats the 
FTRA instances with the solutions (x 3 , y s ) and (xi, yi) as equivalent FTFL 
instances (by naively splitting sites and keeping the clients unchanged) with 
the transformed solutions (x s , y~ s ) and (xi, yi J] respectively. Then, it uses the 
matching and rounding procedures (cf. the paragraph before Lemma 7.1 in |25| ) 
on (x s , y s ) and (xi, yi) to get a feasible solution (x, y) to KFTFL. Finally, 
the solution (x, y) can be easily transformed to a feasible solution (x, y) to 
KFTRA. Now, the important observation is that directly applying GP and RR 
to these FTRA instances with the solutions (x s , y s ) and (xi, yi) from BS ob- 
tains essentially the same solution (x, y) (also the same cost) to KFTRA as 
the naive algorithm does. It is mainly because for the FTRA instances of size 
O (n), our designed GP procedure pairs the integer vectors in polynomial time. 
This is the speed-up of the matching procedure therein [55] applied to the equiv- 
alent FTFL instances of size O (^2i £ jr Ri) ■ Therefore, only in the analysis, we 
can consider the naive algorithm instead to get the following bound for C. This 
analysis trick is similar to the trick used for analyzing the algorithm SGA (cf. 
the paragraph before Theorem [5]). 

Lemma 18. The total expected connection cost C satisfies C < (1 + max (a, b)) C . 



Proof. For the equivalent FTFL instances, we let T' be the set of split facilities 
with size X^gjf ^ anc ^ use ^ t° index these facilities. After the matching and 
rounding procedures in |25| on the transformed solutions {x s , y s ) and (xi, y{), 
we get the solution (x, y) to KFTFL. Also from its Lemma 7.2, we can di- 
rectly obtain the bound Cj < (1 + max (a, b)) X^fcejr' °kj {a>Xs,kj + bxi'kj^where 
Cj = ^2f.pjr/ Ckjx'kj, i.e. the expected connection cost of any client j. Since 
(x s , y s ), (xi, y t ) and (x, y) are transformed from (x s , y s ), (x t , y t ) and (afe, y) 
respectively with the same costs, we have 



4 W.l.o.g., the solutions can be easily transformed between FTRA and FTFL as 
shown in Theorem 7 of [18J. 

5 Note that we get this tighter bound coefficient (1 + max (a, b)) (rather than 2) 
through a bit more careful analysis of the lemma therein. 
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C — CijXij — CkjX'kj 

ieT jec jeCkeT' 

< (1 + max (a, b)) ^ X! Ck i ( ax ^j + bxi' kj ) 
jec kef 

= (1 + max (a, 6)) 2J X! c ^ (^Mi + ^Mi) 

i&F jec 
= (I + max (a, b))C" 



which concludes the lemma. 



Adding up the separate bounds in the previous two lemmas, we get cost (x, y) - 
F + C < F f + (1 + max (a, b)) C. Relating this bound to the bound (2 + jf)F'- 
C < (2 + cost (a', f3' , z', 9') in Lemma 16 we obtain 



cost (x, y) < F' + (1 + max (a, b)) C 

< ( 2 +^) (l + max(a,6))F' + (l + max (a, b)) C' 

< ^2 + (1 + max (a, 6)) cost (a', /3', z\ 9') 

< Acost{cx', f3', z',9') 

. The last inequality is from the fact that a = < 1 — -L (achieved when 

hi = N and fc s = k — 1), and = ^ ~ 1° < 1 — \ (achieved when fc; = fc + 1 and 
k s = 1). Therefore, l + max(a, b) < 2-i and (2 + i) (1 + max (a, 6)) < 
By the weak duality theorem, the approximation ratio is 4. For runtime, from 
the algorithm PK, both GP and RR take O (n 2 ). 

Theorem 6. Algorithm PK is 4- approximation for the uniform KFTRA in 
polynomial time O (n 4 (L + log N + log n)) . 



7 Concluding Remarks 

In this paper, we studied the Constrained Fault- Tolerant Resource Allocation 
(FTRA) problem and its important variant Constrained Fault-Tolerant fc-Resource 
Allocation [KFTRA) problem. In particular, although the Fault- Tolerant Facil- 
ity Location (FTFL) problem is a special case of FTRA, we have shown that 
it is not much harder (in terms of the attained approximation ratios) to ap- 
proximate FTRA than FTFL for both the general and the uniform cases. The 
counterparts of FTRA is the Unconstrained Fault-Tolerant Resource Allocation 
(FTRAoe) problem. This problem was recently claimed in [55] to be easier to 
approximate than FTFL and slightly harder than the classical Uncapacitated 
Facility Location (UFL) problem. 
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From the practical side, our developed resource allocation models inherited 
from FTFL and UFL are more general and applicable than these classical 
models. Therefore, in future, it is worth looking at these models' other important 
variants such as the capacitated variant in |18| , the Reliable Resource Allocation 
(RRA) problem in [TO] and etc. From the theoretical side, two grand challenges 
still remain today: 1) close the approximation gap between FTFL (1.7245) and 
UFL (1.488) or show FTFL is more difficult than UFL; 2) reduce the ratio of 
1.488 to the established lower bound 1.463 for UFL. 
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